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Abstract — Wireless access through a large distributed net- 
work of low-complexity infrastructure nodes empowered with 
cooperation and coordination capabilities, is an emerging radio 
architecture, candidate to deal with the mobile data capacity 
crunch. In the 3GPP evolutionary path, this is known as the 
Cloud-RAN paradigm for future radio. In such a complex net- 
work, distributed MIMO resources optimization is of paramount 
importance, in order to achieve capacity scaling. In this paper, 
we investigate efficient strategies towards optimizing the pairing 
of access nodes with users as well as linear precoding designs 
for providing fair QoS experience across the whole network, 
when data sharing is limited due to complexity and overhead 
constraints. We propose a method for obtaining the exact optimal 
spatial resources allocation solution which can be applied in 
networks of limited scale, as well as an approximation algorithm 
with bounded polynomial complexity which can be used in larger 
networks. The particular algorithm outperforms existing user- 
oriented clustering techniques and achieves quite high quality- 
of-service levels with reasonable complexity. 

Index Terms — C-RAN, Distributed MIMO, Pairing, Linear 
Precoding, MI-SOCP, fairness 

I. Introduction 

With the 4G LTE rollouts gaining momentum all over the 
world, the so-called "lOOOx mobile data challenge" [1] has 
initiated intense discussions among the research community 
for beyond 4G/2020 next generation wireless networks [2]. 
A potential solution for dealing with the expected capacity 
crunch is to massively densify low-cost wireless access net- 
work infrastructure and empower it with coordination and 
cooperation mechanisms, following the recently proposed C- 
RAN [3] or "Cloud-of-Antennas" [4] paradigm. Inspired by 
the well-known principles of Network MIMO [5], an ultra- 
dense network could provide linear capacity scaling if the 
number of infrastructure nodes is at least equal to the num- 
ber of access terminals and the spatial degrees of freedom 
are optimally exploited. However, this intelligent spatial re- 
sources optimization does not come for free, since the net- 
work scalability imposes huge requirements on computational 
complexity, access nodes synchronization, and overhead for 
exchanging control and data signaling. First recent efforts have 
focused on related system requirements and design aspects [6], 
implementation issues (at least for small-scale networks) [7], 
[8], transceiver and resources management algorithms [9], 
[10], as well as overhead reduction techniques [11], [12]. 

In this work we consider a distributed wireless network 
consisting of randomly placed infrastructure nodes or simply 
access points (APs) coordinated by a central entity and serving 



randomly placed user terminals (UEs). UEs' data could be 
fully or partially combined using linear precoding depending 
on the full (global) channel state information. Similarly to the 
definition of universal frequency reuse (UFR) in 4G networks, 
we introduce the concept of "universal space reuse" (USR) for 
Distributed MIMO (D-MIMO) networks, in the sense that each 
AP has access to each UE's traffic, and thus is candidate for 
transmitting its data. As in UFR where the reuse of frequency 
resources may be limited due to interference issues [13], in 
USR space reuse may also be limited due to complexity and 
overhead issues. In the specific context, we seek for efficient 
joint AP-UE pairing and linear precoding strategies over the 
spatial dimension of distributed MIMO setups (that is on a 
single orthogonal resource unit) when data sharing is limited. 
From a QoS perspective, we guarantee fair experience in terms 
of achieved data rate. 



A. Related Work 

The problems of optimum linear precoding for providing 
fair QoS in single-user and multi-user network MIMO systems 
have been formulated and exactly solved in [14] and [15] 
through an iterative second order cone programming (SOCP) 
framework, however these works did not consider data sharing 
overhead limitations but full cooperation. In [16] an appealing 
data sharing cost model was introduced, according to which 
the overhead was expressed through the number of nodes 
having access to each UEs data. However the authors con- 
sidered the sum-rate QoS maximization problem, which may 
lead to unfair assigned data rates and also did not provide any 
solution for the exact optimal precoding desing, but a heuristic 
algorithm. A similar overhead model was proposed in [17] for 
the sum-power minimization problem in cellular networks with 
known data rate/SINR requirements, a QoS model more suit- 
able for constant rate voice-like services but not variable -rate 
data. In addition, the proposed solution approach was based 
on concepts from sparse optimization (e.g. [18]) requiring 
the calculation of a regularization parameter that is generally 
unknown. Finally, in [19] the above framework was extended 
to consider integer activity variables for AP-UE pairings and, 
proposed a Mixed Integer SOCP (MI-SOCP) formulation, 
also involving the unknown regularization parameter. Related 
simpler heuristic algorithms were also proposed. 



B. Contribution - Paper Organization 

In this work, inspired by the framework of [19], we first for- 
mulate the fairness-based QoS problem utiHzing an MI-SOCP 
model as well, however, without including any regularization 
parameter, which can be exactly solved by modem mixed inte- 
ger conic solvers. Secondly we propose and explore different 
variations for the data sharing limitation, that is either the 
overall number or the per-UE number of APs-to-UEs pairings 
is constrained. Third due to complexity and lack of scalability 
of the exact integer optimization model, we propose a faster 
approximation algorithm which iteratively solves a series of 
SOCP pproblems and is demonstrated to perform quite-high 
under bounded and predictable computational complexity. The 
simpler algorithm outperforms baseline schemes which are 
based on clustering principles of multi-cell MIMO networks 
(e.g. [20]). While the exact MI-SOCP solution may be utiHzed 
for smaller networks (up to 10x10 in typical computational 
platforms) due to its high computational requirements, the 
heuristic may be used for realistic larger scale ultra-dense 
networks. 

The rest of the paper is structured as follows. In Section II 
system modeling issues along with the work assumptions are 
presented. In Section III the optimal formulation and solution 
through the mixed integer second order cone programming 
framework is given, while in Section IV the proposed faster 
approximation algorithm is explained. In Section V simulation 
results are given while in Section VI the work is concluded 
and future work directions are stated. 

II. System Modeling and Assumptions 

Let a wireless network comprising M randomly placed 
distributed access nodes/points (APs) serving K users (UEs) 
in a coordinated way following the C-RAN/Network MIMO 
paradigm. A single resource unit (RU) is considered (for 
example an LTE-A time-frequency resource block), which all 
APs/UEs can access through spatial multiplexing. Extension 
to multiple orthogonal resource units is possible by inde- 
pendently applying the system model to each RU separately. 
The APs have knowledge of the network channel state in- 
formation (CSI) through over-the-air feedback links with the 
UEs, and full/partial access to UEs' data through backhaul 
links to facilitate linear precoding. From a quality-of- service 
perspective, we aim at providing fair UE experience following 
the LTE- Advanced service requirements. This is achieved by 
guaranteeing common data rate per UE or equivalently through 
a Shannon-based PHY- abstraction approach [21, Sec. 10.4] 
common signal to interference plus noise ratio (SINR). In this 
sense, we seek for the maximum common SINR, let t*, and 
how to realize it, that is the optimal complex- valued precoding 
matrix W G C^><^ to apply to UEs' data. 

For the partial data sharing case (imposed by backhaul or 
complexity limitations) each {m, /c} element of the precoding 
matrix is actually a semi-continuous complex quantity since 
it could also be zero when the corresponding AP^ does not 
have access to UE/c data, that is the specific AP-UE pair is 
not active. Therefore, as in [19], we also define an auxiliary 



{a2,W2} 




<- ► UEs' Data/Precoder backhaul links 

■ ► CSI Reporting over-the-air links 

< • Pairings of APs with UEs 

hk = CSI vector of UEk to all APs 

Wm : Precoding Row Vector of APm 

ttm: APm-UEs pairing vector (e.g.: Om = [1 1 ... 1]) 



^ 



Determine Optimal W = [wi ; W2 ; ... w^], a = [ai ; 02 ; ... Om] 
for achieving the maximum common SINR t* per UE 

Fig. 1. System Model 



binary-valued AP-UE pairing matrix a G {0, 1} ^ , for 
which the arbitrary arau element is 1 if the rrfi^ AP and the /c* 
UE are paired else is 0. The latter matrix a can be seen as a 
virtual connectivity matrix, and an all-ones matrix corresponds 
to the full data sharing scenario. At the other extreme, each 
column of a has exactly one non-zero element, in order for 
each UE to be served by at least one AP and assigned a non- 
zero data rate, due to the fairness-based QoS profile. 

Perfect Network CSI (measured and reported by the UEs) is 
assumed which corresponds to the normalized complex chan- 
nel matrix H G C^^^ and includes path-loss, shadowing, 
small-scale fading as well as the effect of noise. The latter is 
applied by normalizing the matrix with the noise amplitude 
\/NoAf, where A^o is the noise power density and A/ the 
resource unit transmission bandwidth. The matrix H is struc- 
tured by the per-UE row vectors hk G C^^^, and accordingly 
the precoder matrix W by stacking up the per-UE column 
vectors Wk or equivalently the per-AP row vectors Wm- 
The offered per-UE QoS is expressed through the achieved 



SINR 7/c which is simply given by jk 



||h/cW/e||' 
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System operation is also characterized by a maximum power 
budget Pj^^ on the transmission (Tx) power per AP P^, 
where Pm = Ell^mfeU • In addition, an upper bound on 

k 

the overall or on the per-UE basis number of pairings or 
"virtual connections" among APs and UEs is imposed. For the 
former case the number of pairings ^ ^ amk is simply upper 

k m 

bounded by an integer number btot while for the latter each 
UE's pairings number ^ a^nk is upper bounded by ^o^/j{. An 

m 

illustration of the considered system model is given in Figure 1 
where an AP-UE pairing example is also given. 

III. Optimal Solution 

We first cast the particular QoS provision scenario as an 
optimization problem and provide a method to obtain the 
exact maximum common rate per UE along with the optimal 
AP-UE pairing and Linear Precoding solutions. Towards this 
purpose, we follow the work of [19], however our approach 
differs from the previous one in a threefold way; first we 
consider the fairness problem, while in [19] the minimum sum- 
power minimization for given SINR constraints was examined; 
secondly, the proposed formulation included a regularization 



parameter which is unknown, while our model is related only 
to the actual system parameters; thirdly we introduce new data 
sharing constraints (per-UE), which have practical interest for 
the system designer. The mathematical programming formu- 
lation is given in Eq.(l). 



Algorithm 1 Bisection Search for Maximum Common SINR 



max t 
subject to 
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By simple inspection this problem is a Mixed Integer 
Nonlinear Programming model which is extremely hard to 
solve [22]. However, for a specific given minimum SINR 
constraint ti and after some manipulations as in [5], [15], 
each (lb) constraint c an be rewritten in two expressions 
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which are conic (a norm left-hand- side lower than or equal 
to an affine right-hand- side [23]) and linear respectively. 
Constraints (lc),(ld) are also conic and (le) linear ones, 
therefore for a given SINR target, the problem is a Mixed- 
Integer Second-Order Conic Program (MI-SOCP), which can 
be exactly solved by modem solvers like MOSEK [24] and 
CPLEX [25]. The respective formulation is given in Eq.(2). 



find W, a s.t. 

^{hfcW/e} 



1 



\^ 



(hfcW) 
0,VA: and(lc), (Id), (le) 



< 



1 






(2) 

With respect to locating the maximum common SINR, 
we could apply a bisection search [26], [15], where at each 
step the particular feasibility MI-SOCP problem is solved 
(equivalently instead of feasibility problems, we may set the 
(le) expression as the objective function as well). Therefore, 
the exact optimal solution could be obtained by this iterative 
MI-SOCP procedure, which is given in pseudocode format in 
Alg.l for convenience. Finally note that the left (le) constraint 
expressions represent the most flexible pairing allocation, 
while the rightmost ones the per-UE data sharing limitations, 
so the achieved QoS levels for the former case is the upper 
bound for the latter or any other data sharing constraint. 

IV. A Faster Approximation Algorithm 

A. Motivation 

Although the problem has been formulated as an attrac- 
tive mathematical program, the complexity remains extremely 



Given ti <t <tu, convergence tolerance e 
repeat 

Solve the MI-SOCP feasibility problem (2); 
If a feasible solution is found tu ^ ti else U 
until \tu-ti\ <e. 
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high, due to the involved integer optimization variables needed 
for defining the 0-1 pairing matrix. It is known that the worst- 
case solution complexity of such problems scales exponen- 
tially with the problem size and the time to solve it can not be 
bounded (see for example [27]). Therefore the exact solution 
approach is realistic only for small-scale networks. In order to 
deal with larger problem dimensions we can not rely on this 
model, but we need a simpler lower-complexity algorithm, still 
achieving high quality performance levels. 

B. Algorithm Description and Complexity 

The complexity burden of the original formulation lies 
in the binary pairing matrix. For a given pairing matrix a, 
the problem solved at each iteration of the bisection search 
becomes a simple SOCP (see Eq.(3)) which is convex, and can 
be solved exactly and efficiently (in terms of execution time) 
for large dimensions through available convex optimization 
solvers like SDPT3 [28]. 
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'^m/e = 0, Vr?i, k having a^ 

(3) 
Based on this observation, we propose a "greedy-like" proce- 
dure to approach the data sharing constrained pairing matrix, 
for which at each step a series (due to the bisection search) 
of conic feasibility or optimization problems is solved, hence 
the pairing matrix is approached from the exterior of the 
feasible space [29]. In particular, we begin with an all-ones 
pairing matrix (which corresponds to the full data sharing case 
or in other words to the relaxation of the (le) constraints) 
and gradually (one-by-one) zero specific elements of it, until 
a matrix fulfilling the (le) constraint(s) is acquired. The 
suboptimality of this approach stems from the selection of 
the element to zero at each step of the greedy algorithm 
(since a heuristic criterion is applied) as well as the gradual 
(and not the "single-shot") selection procedure. The advantage 
of this approach is that it has a bounded and predictable 
computational complexity and is robust, since it is based on 
simple heuristic zeroing criterion. 

As far as the selection of which AP-UE pairing to exclude 
from our solution at each step, we first explain what is the 
effect of zeroing an arbitrary w^k element. Based on the 
expression for the SINR per UE given in Eq.(lb) we observe 
that both the received power for the particular UE and the 
interference power to all other UEs are affected. Therefore, in 



order to consider both quantities and balance both effects, we 
define our selection criterion as the ratio of the new received 
power to the interference power and select the combination 
which maximizes this ratio. The complete algorithmic steps 
are given below. 



Algorithm 2 Greedy-like Approximation Algorithm 

Step 1 (Initial Full Data Sharing Solution) 

Run bisection search on Problem (3) for full data sharing 
conditions and acquire an initial optimal precoder. 

Step 2 (Greedy Selection of Pairing Matrix Element to Zero) 

Based on the current precoding matrix calculate for each 
possible AP-UE {m, k} combination the following two 
quantities if this element is zeroed, that is, if Wmk = 0: 



optimal . 



V. Results and Discussion 



i) Received Power for the k'^ UE: RPrr 



\hkWk\ 



ii) Interference Power to rest UEs: Imk = Yl Yl II^^'^jII 

Then, select the combination {m* , /c* } that maximizes 
the ratio of RPmk/Imk, and re-run bisection search for 
problem (3) with the selected pairing matrix element forced 
to be zero (in addition to previously zeroed elements) and 
acquire a new precoder matrix. 
Step 3 (Data Sharing Constraint Check) 

Check if the number of non-zero elements of the pairing (or 
equivalently precoder) matrix exceeds the (le) data sharing 
constraint. If yes go to Step 2, else terminate algorithm. 



As for the estimation of worst-case computational com- 
plexity order for the proposed greedy-like algorithm, we 
first notice that there are two iteration levels, an outer for 
greedily reducing the number of AP-UE pairings and an inner 
for reoptimizing the precoder matrix. Regarding the outer 
procedure, it reduces the AP-UE pairings from MK at the 
initialization phase to the required number set by the data 
sharing constraint, htot^ hence for the stricter limitation case 
requires MK - htot = MK - K ^ MK iterations. For 
the inner procedure, first the best AP-UE pairing is selected 
among the MK possible combinations, with complexity MK. 
Then, a bisection search for the common SINR optimization 
is employed, which needs exactly \\0g2 [{tu — U) / e]] [26, 
Sec.4.2.5] iterations (for typical parameters selection given 
in Section V, there occur 20 iterations per bisection search). 
Einally, at each iteration of the bisection search a SOCP is 
solved. According to [23] for a second order conic program 
with n variables and i conic constraints with dimension m 



each, complexity scales by O I n ^ n^ I . Our formulation 

given in (3) involves MK variables (precoding matrix ele- 
ments), K conic constraints of dimension K each for guaran- 
teeing the minimum SINR per UE, and M conic constraints of 
dimension K each for bounding the Tx power per AP. Hence, 
the SOCP sub-step scales by 0{MK- {K'^^MK)) = 
O {MK ■ (i^2 ^ ^^)) ^ o {(MKf\ . Combing the above 
calculations and after some manipulations, we get a 3rd-order 
polynomial O ( {MK) j complexity order with respect to 
the network size, contrary to exponential 2^^ of the exact 



A. Simulation Setup 

A series of Monte-Carlo simulation experiments is per- 
formed in order to acquire insights on the optimal AP-UE 
pairing patterns for the two data sharing limitation strategies 
(overall/per-UE) and the achieved performance levels of the 
faster greedy-like algorithm. Each computational experiment 
simulates a distributed wireless access network, comprising 
M=6 APs and K=6 UEs (non- symmetrical MIMO networks 
are also fully supported by our framework). Each transmis- 
sion occupies a single resource unit of 200 kHz. Network 
MIMO channel matrices are generated based on the Winnerll 
model [30] and simulate both large-scale (path-loss, shadow- 
ing) and small-scale (fading) effects, while thermal noise with 
density Nq = —174 dBm/Hz corrupts received signals. Each 
AP power is upper bounded by P^^ = 1 W, and APs/UEs are 
randomly dropped over a square area with side length (i^max) 
1000 or 250 m. The reference SNR per link (averaged over all 
AP-UE combinations and computed assuming uniform power 
allocation per UE and maximum power per AP) is 16 dB for 
the sparser and 26 dB for the denser setup. 

Regarding the bisection search, we bound the target SINR as 
^^ = 0, t^i = 10^ and set the termination tolerance to e = 0.01. 
As far as the optimization problems, we use CVX [31] to 
model the optimization formulations in MATLAB along with 
the SDPT3 [28] and MOSEK [24] packages for obtaining the 
solution of conic and mixed integer conic instances. Regarding 
the mixed-integer problems we set an execution upper bound 
of 900 s. Network Rate is calculated by a PHY abstraction 
(see [21, Sec. 10.4]), is evenly divided to the UEs due to fair 
QoS profiling, and is averaged over 20 independent setups. 

B. Simulation Results & Comments 

We first summarize the implemented schemes: 

• Optimal Joint AP-UE Pairing and Linear Precoding with 
overall ("OPT") and per-UE ("OPT-perUE") data shar- 
ing constraints (Alg.l -Section III). 

• The proposed faster greedy-like approximation algorithm 
("APPROX"), refer to Alg.2-Section IV. 

• Two disjoint 2- step AP-UE pairing & linear precoding 
baseline "user-clustering" algorithms ("CLUST-1/2"). At 
the 1st step each UE chooses its best htot/K APs to 
be served based on either a "maximum channel gain" 
or "maximum interfering UEs" criterion (see [20] for a 
description of the corresponding approaches) and then 
independent optimal linear precoding is performed. 

• A scheme based on random AP-UE pairing association 
combined with a subsequent optimal linear precoding 
("RANDOM"). 

^Note that exhaustive search approach is also unaffordable due to the 
excessive number of AP-UE pairing combinations. For example for a 5x5 
network there exist 155 potential AP-UE combinations, and for each one the 
optimal linear precoding should be calculated as well. 
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(a) Data Sharing Limitation Scenario #1 
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(b) Data Sharing Limitation Scenario #2 

Fig. 2. Optimum AP-UE pairings for a random network snapshot and two 
data sharing Hmitation scenarios 



In Figure 2 the optimal active AP-UE pairings acquired 
through Algorithm 1 are graphically illustrated for two data 
sharing constraint scenarios (a "lighter" and a "tighter"). It 
is observed that the operation mode for the whole network 
is a mix of multi-user MIMO,MISO and SISO transmission 
strategies, and that for tighter limitations (see Figure 2(b)) the 
network is decoupled (partitioned) into smaller subnetworks. 
Also, on a per-UE basis, an unequal number of serving APs is 
assigned, demonstrating that the balanced clustering approach 
is suboptimal. 

Figure 3 depicts the comparative achieved (common) rate 
performance for the examined schemes regarding two different 
network density setups. We first remark (by comparing "OPT- 
perUE" with "OPT") that when we impose data sharing 
constraints on a per-UE basis, namely we require that all 
the UEs should have the same number of serving APs, a 
performance penalty is induced. In other words, the ideal AP- 
UE association suggests an unbalanced pairing for the UEs 
(refer to the previous empirical observations as well). Actually, 



UEs which were very close to APs were allocated smaller 
number of APs with low Tx power, contrary to other UEs 
that did not have a high quality AP near to them, and were 
allocated a larger number of APs. The suboptimality levels of 
the "perUE" limitation scenario goes up to 27% for the sparser 
setup and 46% for the denser one. 

Secondly, the two baseline clustering algorithms ("CLUST- 
1/2") experience a large optimality gap from their upper bound 
("OPT-perUE") since they are based on disjoint AP-UE pairing 
and linear precoding. Therefore, a joint pairing-precoding op- 
timization is necessary for achieving high performance levels. 
Thirdly, the proposed greedy-like scheme exhibits acceptable 
performance with reasonable complexity. It even outperforms 
(or is very close to) the "perUE" upper bound, since it does 
not perform balanced pairing among UEs, but a very flexible 
approach is applied at each step of locating the pairs to 
remove. The proposed algorithm achieves on average the 90% 
(Fig. 3 (a)) and 80% (Fig. 3(b)) and for the worst-case scenario 
the 78% (Fig.3(a)) and 65% (Fig.3(b)) of the optimal network 
rate for the different network density setups. We believe that 
for denser network setups and more challenging data sharing 
limitations, the proposed greedy algorithm performance deteri- 
orates due to the large number of necessary heuristic searches 
{MK — btot) and the increase in experienced interference 
levels, which are difficult to be handled by our simple pairing 
removal criterion. The development of other greedy schemes 
with more sophisticated criteria is left for future work, however 
the particular scheme already achieves to a sufficient extent the 
merits of joint pairing and precoding optimization, contrary to 
the disjoint baseline clustering schemes. 

VI. Conclusion & Future Work 

In this paper, the problem of joint design of access points 
and users pairing as well as linear precoding for distributed 
MIMO networks was thoroughly examined. Solutions for net- 
works with finite data sharing capabilities and fairly assigned 
QoS levels across the users, were proposed. First, a method 
for obtaining the optimal D-MIMO design has been devised, 
and second, a faster approximation algorithm with significantly 
lower complexity but still quite high QoS performance also 
shown to outperform existing user-clustering schemes, was 
proposed. The importance of jointly considering the pairing 
and the precoding sub-procedures to D-MIMO design was 
demonstrated. Future work includes the improvement of the 
proposed approximation algorithm, the consideration of other 
mixed QoS models, and the application of the above concepts 
in very-large ultra-dense access networks. 
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